An algebraic semigroup method for discovering maximal frequent itemsets

نویسندگان

چکیده

Abstract Discovering maximal frequent itemsets is an important issue and key technique in many data mining problems such as association rule mining. In the literature, generating proves either to be NP-hard or have O ( l 3 4 m + n ) O\left({l}^{3}{4}^{l}\left(m+n)) complexity worst case from perspective of complete bipartite graphs a graph, where m , n are item number transaction number, respectively, l denotes maximum ∣ C Ψ / − 1 | C| \Psi \left(C)| \hspace{0.1em}\text{/}\hspace{0.1em}\left(| +| -1) with taken over all C . this article, we put forward method for discovering itemsets, whose 2 β O\left(3mn{2}^{\beta }+{4}^{\beta }n) lower than known both case, semigroup algebra, \beta items support more minimum threshold. Experiments also show that algorithm based on algebraic performs better other three well-known algorithms. Meanwhile, explore some properties respect transactions, prove exactly simplified generators give necessary sufficient condition i i+1 -frequent itemset being subset closed i itemset, provide recurrence formula itemsets.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discovering Frequent Closed Itemsets for Association Rules

HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau...

متن کامل

Efficiently Mining Maximal Frequent Itemsets

We present GenMax, a backtrack search based algorithm for mining maximal frequent itemsets. GenMax uses a number of optimizations to prune the search space. It uses a novel technique called progressive focusing to perform maximality checking, and diffset propagation to perform fast frequency computation. Systematic experimental comparison with previous work indicates that different methods have...

متن کامل

A comprehensive method for discovering the maximal frequent set

The association rule mining can be divided into two steps.The first step is to find out all frequent itemsets, whose occurrences are greater than or equal to the user-specified threshold.The second step is to generate reliable association rules based on all frequent itemsets found in the first step. Identifying all frequent itemsets in a large database dominates the overall performance in the a...

متن کامل

Maximal Frequent Itemsets Mining Using Database Encoding

Frequent itemsets mining is a classic problem in data mining and plays an important role in data mining research for over a decade. However, the mining of the all frequent itemsets will lead to a massive number of itemsets. Fortunately, this problem can be reduced to the mining of maximal frequent itemsets. In this paper, we propose a new method for mining maximal frequent itemsets. Our method ...

متن کامل

High Performance Mining of Maximal Frequent Itemsets

Mining frequent itemsets is instrumental for mining association rules, correlations, multi-dimensional patterns, etc. Most existing work focuses on mining all frequent itemsets. However, since any subset of a frequent set also is frequent, it is sufficient to mine only the set of maximal frequent itemsets. In this paper, we study the performance of two existing approaches, Genmax and Mafia, for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Open Mathematics

سال: 2022

ISSN: ['2391-5455']

DOI: https://doi.org/10.1515/math-2022-0516